Protein tyrosine phosphatase PTP20 and related products and methods

ABSTRACT

The present invention relates to a novel polypeptide, PTP20, and to nucleic acid molecules encoding the polypeptide. The invention also relates to nucleic acid molecules encoding portions of the phosphatase, nucleic acid vectors containing PTP20 related nucleic acid molecules, recombinant cells containing such nucleic acid vectors, polypeptides purified from such recombinant cells, antibodies to such polypeptides, and methods of identifying compounds that bind PTP20 or abrogate its interactions with natural binding partners. Also disclosed are methods for diagnosing abnormal conditions in an organism with PTP20 related molecules or compounds.

RELATED APPLICATIONS

This application is related to the U.S. Provisional Patent Application Ser. No. 60/030,860, by Aoki, et al., entitled "Protein Tyrosine Phospahatase PTP20 and Related Products and Methods," filed Nov. 13, 1996, and PCT Application No. PCT/1897/00946, filed Jun. 17, 1997, both of which are incorporated herein by reference in their entirety including any drawings.

INTRODUCTION

The present invention relates generally to a newly identified protein tyrosine phosphatase and related products and methods.

BACKGROUND OF THE INVENTION

The following description of the background of the invention is provided to aid in understanding the invention, but is not admitted to describe or constitute prior art to the invention.

Cellular signal transduction is a fundamental mechanism whereby extracellular stimuli are relayed to the interior of cells and subsequently regulate diverse cellular processes. One of the key biochemical mechanisms of signal transduction involves the reversible phosphorylation of proteins. Phosphorylation of polypeptides regulates the activity of mature proteins by altering their structure and function. Phosphate most often resides on the hydroxyl moiety (--OH) of serine, threonine, or tyrosine amino acids in proteins.

Enzymes that mediate phosphorylation of cellular effectors generally fall into two classes. The first class consists of protein kinases which transfer a phosphate moiety from adenosine triphosphate to protein substrates. The second class consists of protein phosphatases which hydrolyze phosphate moieties from phosphoryl protein substrates. The converse functions of protein kinases and protein phosphatases balance and regulate the flow of signals in signal transduction processes.

Protein kinases and protein phosphatases are generally divided into two groups--receptor and non-receptor type proteins. Most receptor-type protein tyrosine phosphatases contain two conserved catalytic domains, each of which encompasses a segment of 240 amino acid residues. Saito et al., 1991, Cell Growth and Diff. 2:59-65. Receptor protein tyrosine phosphatases can be subclassified further based upon the amino acid sequence diversity of their extracellular domains. Saito et al., supra; Krueger et al., 1992, Proc. Natl. Acad. Sci. USA 89:7417-7421.

Protein kinases and protein phosphatases are also typically divided into three classes based upon the amino acids they act upon. Some catalyze the addition or hydrolysis of phosphate on serine or threonine only, some catalyze the addition or hydrolysis of phosphate on tyrosine only, and some catalyze the addition or hydrolysis of phosphate on serine, threonine, and tyrosine.

Tyrosine phosphatases can down-regulate the catalytic activity of protein kinases involved in cell proliferation and are therefore thought to be possible anti-cancer proteins. Protein phosphatases with inappropriate activity are also involved in some types of cancer. Because abnormally elevated levels of cell proliferation are associated with receptor and non-receptor protein kinases with unregulated activity, protein phosphatase-catalyzed dephosphorylation of a protein kinase can down-regulate kinase activity and thereby decrease the rate of cell proliferation.

In addition to their role in cellular proliferation, protein phosphatases are thought to be involved in cellular differentiation processes. Cell differentiation occurs in some cells upon nerve growth factor (NGF) or epidermal growth factor (EGF) stimulation. Cellular differentiation is characterized by rapid membrane ruffling, cell flattening, and increases in cell adhesion. Chao, 1992, Cell 68:995-997.

Alignment of primary amino acid sequences of known PTPs shows that their catalytic domains share common amino acid sequences. This observation has facilitated efforts of cloning protein phosphatases from multiple organisms and tissues. Probing cDNA libraries with polynucleotides complementary to cDNA encoding protein phosphatase consensus sequences has identified cDNAs resembling protein phosphatase sequences via the polymerase chain reaction (PCR). Some polypeptide molecules encoded by these cDNAs have tyrosine phosphatase activity.

SUMMARY OF THE INVENTION

The present invention relates to nucleic acid molecules encoding a newly identified protein tyrosine phosphatase named PTP20, nucleic acid molecules encoding portions of the full length protein, nucleic acid vectors harboring such nucleic acid molecules, cells containing such nucleic acid vectors, purified polypeptides encoded by such nucleic acid molecules, antibodies to such proteins and polypeptides, and methods of identifying compounds that bind PTP20 or abrogate its interactions with natural binding partners. Also disclosed are methods for diagnosing abnormal conditions in an organism with PTP20 related molecules or compounds. The nucleic acid molecules, nucleic acid vectors, host cells, polypeptides, and antibodies may be produced using the information provided herein in conjunction with well known and standard techniques used currently in the art.

The present invention is based in part upon the isolation and characterization of nucleic acid molecules encoding a novel protein phosphatase designated PTP20. PTP20 regulates growth factor stimulation of cellular differentiation. PTP20 is thought to be involved in cellular differentiation, as its over-expression in rat pheochromocytoma cells (PC12) is associated with increased rates of differentiation. Various treatments of neural cancers as well as neural damage are thus provided based on the discovery of PTP20 and its role in these disorders.

Thus in a first aspect, the invention features an isolated, enriched, or purified nucleic acid molecule encoding a PTP20 polypeptide.

The term "isolated", in reference to nucleic acid molecules, indicates that a naturally occurring sequence has been removed from its normal cellular environment. Thus, the sequence may be in a cell-free solution or placed in a different cellular environment. The term does not imply that the sequence is the only nucleotide chain present, but that it is essentially free (about 90-95% pure at least) of non-nucleotide material such as chromosomal DNA or proteins.

The term "enriched", in reference to nucleic acid molecules, means that the specific DNA or RNA sequence constitutes a significantly higher fraction (2-5 fold) of the total DNA or RNA present in the cells or solution of interest than in normal or diseased cells or in the cells from which the sequence was taken. A person skilled in the art could enrich a nucleic acid mixture by preferentially reducing the amount of other DNA or RNA present, or preferentially increasing the amount of the specific DNA or RNA, or both. However, nucleic acid molecule enrichment does not imply that there is no other DNA or RNA present, the term only indicates that the relative amount of the sequence of interest has been significantly increased. The term "significantly" qualifies "increased" to indicate that the level of increase is useful to the person performing the recombinant DNA technique, and generally means an increase relative to other nucleic acids of at least 2 fold, or more preferably at least 5 to 10 fold or more. The term also does not imply that there is no DNA or RNA from other sources. Other DNA may, for example, comprise DNA from a yeast or bacterial genome, or a cloning vector. In addition, levels of mRNA may be naturally increased relative to other species of mRNA when working with viral infection or tumor growth techniques. The term "enriched" is meant to cover only those situations in which a person has intervened to elevate the proportion of the desired nucleic acid.

Most methods of recombinant nucleic acid manipulation require that these molecules are in a purified form. The term "purified", in reference to nucleic acid molecules does not require absolute purity (such as a homogeneous preparation); instead, it represents an indication that the sequence is relatively more pure than in its cellular environment (compared to the natural level this level should be at least 2-5 fold greater, e.g., in terms of mg/ml). The claimed DNA molecules obtained from clones could be obtained directly from total DNA or from total RNA. cDNA clones are not naturally occurring, but rather are preferably obtained via manipulation of a partially purified, naturally occurring substance (messenger RNA). The construction of a cDNA library from mRNA involves the creation of a synthetic substance (cDNA). Individual cDNA clones can be isolated from the synthetic library by clonal selection of the cells carrying the cDNA library. Thus, the process which includes the construction of a cDNA library from mRNA and isolation of distinct cDNA clones yields an approximately 10⁶ -fold purification of the native message. Thus, purification of at least one order of magnitude, preferably two or three orders, and more preferably four or five orders of magnitude is favored in these techniques.

The term "nucleic acid molecule" describes a polymer of deoxyribonucleotides (DNA) or ribonucleotides (RNA). The nucleic acid molecule may be isolated from a natural source by cDNA cloning or subtractive hybridization or synthesized manually. The nucleic acid molecule may be synthesized manually by the triester synthetic method or by using an automated DNA synthesizer.

The term "cDNA cloning" refers to hybridizing a small nucleic acid molecule, a probe, to genomic cDNA that is bound to a membrane. The probe hybridizes (binds) to complementary sequences of cDNA.

The term "complementary" describes two nucleotides that can form multiple favorable interactions with one another. For example, adenine is complementary to thymidine as they can form two hydrogen bonds. Similarly, guanine and cytosine are complementary since they can form three hydrogen bonds. Thus a "complement" of a nucleic acid molecule is a molecule containing adenine instead of thymine, thymine instead of adenine, cytosine instead guanine, and guanine instead of cytosine. Because the complement contains a nucleic acid sequence that forms optimal interactions with the parent nucleic acid molecule, such a complement binds with high affinity to its parent molecule.

The term "hybridize" refers to a method of interacting a nucleic acid molecule (e.g., a nucleic acid probe) with a DNA or RNA molecule in solution or on a solid support, such as cellulose or nitrocellulose. If a nucleic acid probe binds to the DNA or RNA molecule with high affinity, it is said to "hybridize" to the DNA or RNA molecule. As mentioned above, the strength of the interaction between the probe and its target can be assessed by varying the stringency of the hybridization conditions. Various low or high stringency hybridization conditions may be used depending upon the specificity and selectivity desired. Stringency is controlled by varying salt or denaturant concentrations. Those skilled in the art will recognize how such conditions can be varied to vary specificity and selectivity. Under highly stringent hybridization conditions only highly complementary nucleic acid sequences hybridize. Preferably, such conditions prevent hybridization of nucleic acids having one or two mismatches out of 20 contiguous nucleotides.

cDNAs are molecules that may be reverse-transcribed from fragments of message RNA from a genomic source. These fragments form a cDNA library of nucleic acid molecules. cDNA libraries are constructed from natural sources such as mammalian blood, semen, or tissue.

The term "subtractive hybridization" refers to a method similar to cDNA cloning except that cDNA prepared from mRNA in unstimulated cells is added to mRNA in stimulated or different types of cells. cDNA/mRNA can then be precipitated to enrich the mRNA specific to the stimulation signal or different cell type.

The term "PTP20 polypeptide" refers to a polypeptide having an amino acid sequence preferably of at least 400 contiguous amino acids, more preferably of at least 450 contiguous amino acids, or most preferably of at least 453 contiguous amino acids set forth in SEQ ID NO:1, or is substantially similar to such a sequence, or have substantially similar functional activities such as phosphatase activity or growth or differentiation activity, and may be measured as described herein, e.g. at p. 26, lines 9-25, or p. 29, lines 14-28. A sequence that is substantially similar will preferably have at least 90% identity (more preferably at least 95% and most preferably 99-100% identity) to the amino acid sequence of SEQ ID NO:1. PTP20 polypeptides preferably have tyrosine phosphatase activity and fragments of the full length PTP20 sequence having such activity may be identified using techniques well known in the art, such as sequence comparisons and assays such as those described in the examples herein.

The term "identity" refers to a property of sequences that measures their similarity or relationship. Identity is measured by dividing the number of identical residues by the total number of residues and multiplying the product by 100. Thus, two copies of exactly the same sequence have 100% identity, but sequences that are less highly conserved and have deletions, additions, or replacements may have a lower degree of identity. Those skilled in the art will recognize that several computer programs are available for determining sequence identity. Such programs are generally able to achieve maximum alignment by ignoring deletions or additions that would otherwise alter the calculation of the percentage of identity between two sequences.

A preferred embodiment concerns nucleic acid molecules relating to PTP20 enriched, isolated, or purified from a mammalian source. These nucleic acid molecules can be isolated from, among other sources, blood, semen, or tissue.

The term "mammalian" refers to such organisms as, for example, mice, rats, rabbits, goats, monkeys, apes, and preferably humans. Although the PTP20 nucleic acid molecule of SEQ ID NO:1 is isolated from rat cells, current recombinant DNA techniques can readily elucidate a related nucleic acid molecule in human tissue.

Another preferred embodiment concerns an isolated nucleic acid molecule relating to PTP20 that encodes at least twelve contiguous amino acids of the amino acid sequence set forth in SEQ ID NO:7. Preferably at least 12, 15, 20, 25, 30, 35, 40, 50, 100, 200 or 300 contiguous amino acids of the PTP20 sequence are encoded. This preferred embodiment of the invention is achieved by applying routine recombinant DNA techniques known to those skilled in the art.

Another aspect of the invention features a nucleic acid probe that can detect nucleic acid molecules encoding a PTP20 polypeptide in a sample.

The term "nucleic acid probe" refers to a nucleic acid molecule that is complementary to and can bind a nucleic acid sequence encoding the amino acid sequence substantially similar to that set forth in SEQ ID NO:7.

By "substantially similar" it is meant a sequence that will preferably have at least 90% identity (more preferably at least 95% and most preferably 99-100%) to the sequence set forth in SEQ ID NO:7.

By "identity" is meant a property of sequences that measures their similarity or relationship. Identity is measured by dividing the number of identical residues in the two sequences by the total number of residues and multiplying the product by 100. Thus, two copies of exactly the same sequence have 100% identity, but sequences that are less highly conserved and have deletions, additions, or replacements have a lower degree of identity. Those skilled in the art will recognize that several computer programs are available for determining sequence identity.

The nucleic acid probe or its complement encodes any one of the amino acid molecules set forth in the invention. Thus the nucleic acid probe can encode at least 12, 75, 90, 105, 120, 150, 200, 250, 300 or 350 contiguous amino acids of the full-length sequence set forth in SEQ ID NO:7.

The nucleic acid probe can be labeled with a reporter molecule or molecules. The term "reporter molecule" refers to a molecule that is conjugated to the nucleic acid probe or is contained within the nucleic acid probe. The reporter molecule allows the detection of the probe by methods used in the art. Reporter molecules are chosen from, but not limited to, the group consisting of an enzyme, such as a peroxidase, a radioactive element, or an avidin molecule.

A nucleic acid probe, whether labeled or unlabeled, should hybridize to a complement in a sample.

The nucleic acid probe of the present invention can be a nucleic acid molecule encoding a conserved or unique region of amino acids of PTP20. These nucleic acid molecules are useful as hybridization probes to identify and clone additional polypeptides relating to PTP20.

The term "conserved nucleic acid regions", refers to regions present in two or more nucleic acid molecules encoding a PTP20 polypeptide, to which a particular nucleic acid sequence can hybridize under low stringency conditions. Examples of low stringency conditions suitable for screening nucleic acid molecules encoding PTP20 polypeptides are provided in Abe, et al. J. Biol. Chem., 12:13361 (1992) (hereby incorporated by reference herein in its entirety, including any drawings). Preferably, conserved regions differ by no more than 5 out of 20 nucleotides. As mentioned above, protein tyrosine phosphatases share conserved regions in their extracellular and catalytic domains.

The term "unique nucleic acid region" concerns a sequence present in a full length nucleic acid coding for a PTP20 polypeptide that is not present in a sequence coding for any other naturally occurring polypeptide. Such regions preferably comprise 30 or 45 contiguous nucleotides present in the full length nucleic acid sequence encoding a PTP20 polypeptide. In particular, a unique nucleic acid region is preferably of mammalian origin.

Methods for using the probes include detecting the presence or amount of PTP20 RNA in a sample by contacting the sample with a nucleic acid probe under conditions such that hybridization occurs. The nucleic acid duplex formed between the probe and a nucleic acid sequence coding for a PTP20 polypeptide may be used in the identification of the sequence of the nucleic acid detected (for example see, Nelson et al., in Nonisotopic DNA Probe Techniques, p. 275 Academic Press, San Diego (Kricka, ed., 1992) hereby incorporated by reference herein in its entirety, including any drawings). Kits for performing such methods may be constructed to include a container holding a nucleic acid probe.

In yet another aspect, the invention relates to a nucleic acid vector comprising a nucleic acid molecule encoding a PTP20 polypeptide and a promoter element effective to initiate transcription in a host cell.

The term "nucleic acid vector" relates to a single or double stranded circular nucleic acid molecule that can be transfected or transformed into cells and replicate independently or within the host cell genome. A circular double stranded nucleic acid molecule can be cut and thereby linearized upon treatment with restriction enzymes. An assortment of vectors, restriction enzymes, and the knowledge of the nucleotide sequences that the restriction enzymes operate upon are readily available to those skilled in the art. A nucleic acid molecule of the invention can be inserted into a vector by cutting the vector with restriction enzymes and ligating the two pieces together.

Many techniques are available to those skilled in the art to facilitate transformation or transfection of the expression construct into a prokaryotic or eukaryotic organism. The terms "transformation" and "transfection" refer to methods of inserting an expression construct into a cellular organism. These methods involve a variety of techniques, such as treating the cells with high concentrations of salt, an electric field, or detergent, to render the host cell outer membrane or wall permeable to nucleic acid molecules of interest.

The term "promoter element" describes a nucleotide sequence that is incorporated into a vector that, once inside an appropriate cell, can facilitate transcription factor and/or polymerase binding and subsequent transcription of portions of the vector DNA into mRNA. The promoter element precedes the 5' end of the PTP20 nucleic acid molecule such that the latter is transcribed into mRNA. Host cell machinery then translates mRNA into a polypeptide.

Those skilled in the art would recognize that a nucleic acid vector can contain many other nucleic acid elements besides the promoter element and the PTP20 nucleic acid molecule. These other nucleic acid elements include, but are not limited to, origins of replication, ribosomal binding sites, nucleic acid sequences encoding drug resistance enzymes or amino acid metabolic enzymes, and nucleic acid sequences encoding secretion signals, periplasm or peroxisome localization signals, or signals useful for polypeptide purification.

A nucleic acid vector can be useful for identifying natural binding partners of PTP20 polypeptides.

The term "natural binding partners" refers to polypeptides that bind to PTP20 and play a role in propagating a signal in a signal transduction process. The term "binding partner" also refers to a polypeptide that binds to PTP20 within a cellular environment with high affinity. High affinity represents an equilibrium binding constant on the order of 10⁻⁶ M. However, a natural binding partner can also transiently interact with a PTP20 polypeptide and chemically modify it. PTP20 natural binding partners are chosen from a group consisting of, but inot limited to, src homology 2 (SH2) (Sadowski, et al, Mol. Cell. Biol. 6:4396, 1986; Pawson and Schlessinger, Curr. Biol. 3:434, 1993) or 3 (SH3) domains (Mayer, et al, Nature 332:272, 1988; Pawson and Schlessinger, Curr. Biol. 3:434, 1993), other phosphoryl tyrosine binding domains, and receptor and non-receptor protein kinases or protein phosphatases.

Methods are readily available in the art for identifying natural binding partners of polypeptides of interest by screening cDNA libraries included in one nucleic acid vector with a nucleic acid molecule encoding the desired polypeptide in another expression construct. Vojtek et al., 1993, Cell 74:205-214. These techniques often utilize two halves of a transcription factor, one of which is fused to a polypeptide encoded by the cDNA library, and the other of which is fused to the polypeptide of interest. Interactions between a polypeptide encoded by the cDNA library and the polypeptide of interest are detected when their interaction concomitantly brings together the two halves of the transcription factor and activates a gene that reports the interaction. Any of the nucleic molecules encoding PTP20 polypeptide can be readily incorporated into a nucleic acid vector used in such a screening procedure by utilizing standard recombinant DNA techniques in the art.

Another aspect of the invention relates to a recombinant cell or tissue comprising a nucleic acid molecule encoding a PTP20 polypeptide.

The term "recombinant" refers to an organism that has a new combination of genes or nucleic acid molecules. A new combination of genes or nucleic acid molecules can be introduced to an organism using a wide array of nucleic acid manipulation techniques available to those skilled in the art. The recombinant cell can be a eukaryotic or a prQkaryotic organism.

The term "eukaryote" refers to an organism comprised of cells that contain a nucleus. Eukaryotes are differentiated from "prokaryotes" which do not house their genomic DNA inside a nucleus. Prokaryotes include unicellular organisms such as bacteria while eukaryotes are represented by yeast, invertebrates, and vertebrates.

The term "organism" relates to any living being comprised of at least one cell. An organism can be as simple as one eukaryotic cell or as complex as a mammal.

The recombinant cell can harbor a nucleic acid vector that is extragenomic. The term "extragenomic" refers to a nucleic acid vector which does not insert into the cell genome. Many nucleic acid vectors are designed with their own origins of replication allowing them to utilize the recombinant cell replication machinery to copy and propagate the vector nucleic acid sequence. These vectors are small enough that they are not likely to harbor nucleic acid sequences homologous to genomic sequences of the recombinant cell. Thus these vectors replicate independently of the host genome and do not recombine with or integrate into the genome.

A recombinant cell can harbor a portion of a nucleic acid vector in an intragenomic fashion. The term "intragenomic" defines a nucleic acid construct that is incorporated within the cell genome. Multiple nucleic acid vectors available to those skilled in the art contain nucleic acid sequences that are homologous to nucleic acid sequences in a particular organism's genomic DNA. These homologous sequences will result in recombination events that integrate portions of the vector into the genomic DNA. Those skilled in the art can control which nucleic acid sequences of the vector are integrated into the cell genome by flanking the portion to be incorporated into the genome with homologous sequences in the vector.

Yet another aspect of the invention features an isolated, enriched, or purified PTP20 polypeptide.

The term "isolated", in reference to a polypeptide, describes a polymer of amino acids conjugated to one another, including polypeptides that are isolated from a natural source or that are synthesized. In certain aspects longer polypeptides are preferred, such as those with most of the contiguous amino acids set forth in SEQ ID NO:7.

The isolated polypeptides of the present invention are unique in the sense that they are not found in a pure or separated state in nature. Use of the term "isolated" indicates that a naturally occurring sequence has been removed from its normal cellular environment. Thus, the sequence may be in a cell-free solution or placed in a different cellular environment. The term does not imply that the sequence is the only amino acid chain present, but that it is essentially free (about 90-95% pure at least) of non-amino acid material naturally associated with it.

The term "enriched", in reference to a polypeptide, defines a specific amino acid sequence constituting a significantly higher fraction (2-5 fold) of the total of amino acids present in the cells or solution of interest than in normal or diseased cells or in the cells from which the sequence was separated. A person skilled in the art can preferentially reduce the amount of other amino acid sequences present, or preferentially increase the amount of specific amino acid sequences of interest, or both. However, the term "enriched" does not imply that there are no other amino acid sequences present. Enriched simply means the relative amount of the sequence of interest has been significantly increased. The term "significant" indicates that the level of increase is useful to the person making such an increase. The term also means an increase relative to other amino acids of at least 2 fold, or more preferably at least 5 to 10 fold, or even more. The term also does not imply that there are no amino acid sequences from other sources. Other source amino acid sequences may, for example, comprise amino acid sequences from a host organism. "Enriched" is meant to cover only those situations in which a person has intervened to elevate the proportion of the desired amino acid sequence.

The term "purified", in reference to a polypeptide, does not require absolute purity (such as a homogeneous preparation); instead, it represents an indication that the amino acid sequence is relatively more pure than in a cellular environment. The concentration of the preferred amino acid sequence should be at least 2-5 fold greater (in terms of mg/ml) than its concentration in a cellular environment. Purification of at least one order of magnitude, preferably two or three orders, and more preferably four or five orders of magnitude is preferred. The substance is preferably free of contamination, as indicated by purity levels of 90%, 95%, or 99%.

A preferred embodiment relates to a PTP20 polypeptide that is a unique fragment of a PTP20 polypeptide.

The term "unique fragment" refers to a stretch of contiguous amino acids in PTP20 that is of a different sequence than another PTP. At least 12, 15, 20, 25, 30, 35, 40, 50, 100, 150, 200, 250, 300, or 350 contiguous amino acids of the full-length amino acid sequence of PTP20 are unique to PTP20.

The PTP20 polypeptide can be isolated, enriched, or purified from a prokaryotic or eukaryotic recombinant cell. A eukaryotic cell can arise from organisms including mammals and preferably humans. Multiple standard techniques are available to those skilled in the art to facilitate isolation, enrichment, or purification of a polypeptide from recombinant cells. These methods typically include lysing the recombinant cells and separating the polypeptide of interest from the rest of the cell polypeptides, nucleic acids, and fatty acid-based material using standard techniques known in the art.

Another aspect of the invention features an antibody that is monoclonal or polyclonal, or an antibody fragment having specific binding affinity to a PTP20 polypeptide.

Antibodies or antibody fragments are polypeptides which contain regions that can bind other polypeptides. The term "specific binding affinity" describes an antibody that binds to a PTP20 polypeptide with greater affinity than it binds to other polypeptides under specified conditions.

The term "polyclonal" refers to antibodies that are heterogenous populations of antibody molecules derived from the sera of animals immunized with an antigen or an antigenic functional derivative thereof. For the production of polyclonal antibodiwes, various host animals may be immunized by injection with the antigen. Various adjuvants may be used to increase the immunological reponse, depending on the host species.

"Monoclonal antibodies" are substantially homogenous populations of antibodies to a particular antigen. They may be obtained by any techniques which provides for the production of antibody molecules by continuous cell lines in culture. Monoclonal antibodies may be obtained by methods known to those skilled in the art. See, for example, Kohler, et al., Nature 256:495-497 (1975), and U.S. Pat. No. 4,376,110.

The term "antibody fragment" refers to a portion of an antibody, often the hypervariable region and portions of the surrounding heavy and light chains, that displays specific binding affinity for a particular molecule. A hypervariable region is a portion of an antibody that physically binds to the polypeptide target.

Antibodies or antibody fragments having specific binding affinity to a PTP20 polypeptide may be used in methods for detecting the presence and/or amount of a PTP20 polypeptide in a sample by probing the sample with the antibody under conditions suitable for PTP20-antibody immunocomplex formation and detecting the presence and/or amount of the antibody conjugated to the PTP20 polypeptide. Diagnostic kits for performing such methods may be constructed to include antibodies or antibody fragments specific for PTP20 as well as a conjugate of a binding partner of the antibodies or the antibodies themselves.

An antibody or antibody fragment with specific binding affinity to a PTP20 polypeptide can be isolated, enriched, or purified from a prokaryotic or eukaryotic organism. Routine methods known to those skilled in the art enable production of antibodies or antibody fragments, in both prokaryotic and eukaryotic organisms. Purification, enrichment, and isolation of antibodies, which are polypeptide molecules, are described above.

Another aspect of the invention features a hybridoma which produces an antibody having specific binding affinity to a PTP20 polypeptide. A "hybridoma" is an immortalized cell line which is capable of secreting an antibody, for example an antibody with specific binding affinity to PTP20.

Another aspect of the invention features an isolated, enriched, or purified nucleic acid molecule comprising a nucleotide sequence that: (a) encodes a polypeptide having the full length amino acid sequence set forth SEQ ID NO:7; (b) is the complement of the nucleotide sequence of (a); (c) hybridizes under highly stringent conditions to the nucleotide molecule of (b) and encodes a naturally occurring PTP20 protein; (d) encodes a PTP20 polypeptide having the full length amino acid sequence of the sequence set forth in SEQ ID NO:7 except that it lacks one or more of the following segments of amino acid residues 1-58, 59-294, or 295-453. (e) is the complement of the nucleotide sequence of (d); (f) encodes a polypeptide having the amino acid sequence set forth in SEQ ID NO:7 from amino acid residues 1-58, 59-294, or 295-453; (g) is the complement of the nucleotide sequence of (f); or (h) encodes a polypeptide having the full length amino acid sequence set forth in SEQ ID NO:7 except that it lacks one or more of the domains selected from the group consisting of a N-terminal domain, a catalytic domain, and a C-terminal domain, or (i) is the complement of the nucleotide sequence of (h).

The term "N-terminal domain" refers to a portion of the full length PTP20 amino acid sequence spanning from the amino terminus to the start of the catalytic domain. The N-terminal domain spans amino acid residues 1-58 of the sequence set forth in SEQ ID NO:7.

The term "catalytic domain" refers to a portion of the PTP20 amino acid molecule that does not contain the N-terminal domain and has catalytic activity. The catalytic domain spans amino acid residues 59-294 of the sequence set forth in SEQ ID NO:7.

The term "C-terminal domain" refers to a portion of PTP20 that begins at the end of the catalytic domain and ends at the carboxy terminal amino acid, which is the last amino acid encoded before the stop codon in the nucleic acid sequence. The C-terminal domain spans amino acid residues 295-453 of the sequence set forth in SEQ ID NO:7.

Domains are regions of polypeptides which have particular functions. For instance, N-terminal or C-terminal domains of signal transduction proteins can serve functions including, but not limited to, binding molecules that localize the signal transduction molecule to different regions of the cell or binding other signaling molecules directly responsible for propagating a particular cellular signal. Some domains can be expressed separately from the rest of the protein and function by themselves, while others must remain part of the intact protein to retain function. The latter are termed functional regions of proteins and also relate to domains.

Functional regions of PTP20 may be identified by aligning the amino acid sequence of PTP20 with amino acid sequences of other polypeptides with known functional regions. If regions of PTP20 share high amino acid identity with the amino acid sequences of known functional regions, then PTP20 can be determined to contain these functional regions by those skilled in the art. The functional regions can be determined, for example, by using computer programs and sequence information available to those skilled in the art.

Other functional regions of signal transduction molecules that may exist in the PTP20 amino acid sequence include, but are not limited to, proline-rich regions or phosphoryl tyrosine regions. These regions can interact with natural binding partners such as SH2 or SH3 domains of other signal transduction molecules.

In yet another aspect, the invention includes a nucleic acid vector containing a nucleic acid molecule described above.

Another aspect of the invention relates to a recombinant cell or tissue that contains a nucleic acid molecule described above.

In yet another aspect, the invention features a method of identifying compounds capable of modulating PTP20 catalytic activity. This method consists of the following steps: (a) adding a compound to cells containing a PTP20 polypeptide; and (b) detecting a change in the catalytic activity of the PTP20 polypeptide.

The term "modulating" refers to the ability of a compound to alter PTP20 catalytic activity. A modulator preferably activates PTP20 catalytic activity, more preferably activates or inhibits PTP20 catalytic activity depending on the concentration of the compound exposed to PTP20, or most preferably inhibits PTP20 catalytic activity.

The term "compound" includes small organic molecules including, but not limited to, oxindolinones, quinazolines, tyrphostins, quinoxalines, and extracts from natural sources.

The term "a change in catalytic activity", in the context of the invention, defines a method of observing a change in PTP20 catalytic activity in response to adding a compound to cells. The catalytic activity of a PTP20 polypeptide can be detected, for example, by measuring the amount of a substrate, such as p-nitrophenylphosphate, converted to a product, such as p-nitrophenol, with respect to time. Addition of a compound to cells expressing a PTP20 polypeptide may either enhance (activate) or lower (inhibit) the catalytic activity. If a compound lowers PTP20 catalytic activity, the compound is assumed to bind to a PTP20 polypeptide and block the ability of PTP20 to bind and/or turn over a substrate. If a compound enhances PTP20 catalytic activity, the compound is assumed to bind to a PTP20 polypeptide and facilitate the ability of PTP20 to bind and/or turn over a substrate.

The method can utilize any of the molecules disclosed in the invention. These molecules include nucleic acid molecules encoding PTP20 polypeptides, nucleic acid vectors, recombinant cells, polypeptides, or antibodies of the invention.

Another aspect of the invention relates to a method of identifying compounds useful for diagnosis or treatment of an abnormal condition in an organism. The abnormal condition can be associated with an aberration in a signal transduction pathway characterized by an interaction between a PTP20 polypeptide and a natural binding partner. The method comprises the following steps: (a) adding a compound to cells; and (b) detecting whether the compound promotes or disrupts an interaction between a PTP20 polypeptide and a natural binding partner.

The term "abnormal condition" refers to a function in an organism's cells or tissue that deviate from a normal function in the cells or tissue of that organism. In the context of this aspect of the invention, abnormal conditions can be associated with, for example, cell proliferation. Cell proliferative disorders include, but are not limited to, cancers such as fibrotic and mesangial disorders, abnormal angiogenesis and vasculogenesis, slow wound healing rates, psoriasis, diabetes mellitus, and inflammation. Abnormal conditions can also be associated with cell differentiation. Cell differentiation disorders include, but are not limited to, neurodegenerative disorders, slow wound healing rates, and grafting tissue grafting techniques.

The abnormal condition can be diagnosed when the organism's cells exist within the organism or outside of the organism. Cells existing outside the organism can be maintained or grown in cell culture dishes. For cells harbored within the organism, many techniques exist in the art to administer compounds, including (but not limited to) oral, parenteral, dermal, and injection applications. For cells outside of the patient, multiple techniques exist in the art to administer the compounds, including (but not limited to) cell microinjection techniques, transformation techniques, and carrier techniques.

The term "signal transduction pathway" refers to the molecules that propagate an extracellular signal through the cell membrane to become an intracellular signal. This signal can then stimulate a cellular response. The polypeptide molecules involved in signal transduction processes are typically receptor and non-receptor protein kinases, receptor and non-receptor protein phosphatases, and transcription factors.

The term "aberration", in conjunction with a signal transduction process, refers to a polypeptide, for example PTP20, that is over- or under-expressed in an organism, mutated such that its catalytic activity is lower or higher than wild-type polypeptide, mutated such that it can no longer interact with a binding partner, is no longer modified by another protein kinase or protein phosphatase, or no longer interacts with a binding partner.

The term "interaction" defines the complex formed between a PTP20 polypeptide and a natural binding partner. Compounds can bind to either the PTP20 polypeptide or the natural binding partner and disrupt the interaction between the two molecules.

The term "promote or disrupt the abnormal interaction" refers to a method that can be accomplished by administering a compound to cells or tissues in an organism. A compound can promote an interaction between PTP20 and natural binding partners by forming favorable interactions with multiple atoms at the complex interface. Alternatively, a compound can inhibit an interaction between a protein kinase and natural binding partners by compromising favorable interactions formed between atoms at the complex interface.

Methods of detecting the ability of a compound to disrupt or enhance an interaction between PTP20 and a natural binding partner exist in the art. These methods include, but are not limited to, determining the effect of the compound upon the catalytic activity of a PTP20 polypeptide, the phosphorylation state of the PTP20 polypeptide or a natural binding partner, the ability of PTP20 to bind a natural binding partner, or a difference in a cell morphology. Differences in cell morphology include growth rates and differentiation rates of cells. These phenomena are simply measured by methods in the art. These methods typically involve observing the number of cells or the appearance of cells under a microscope with respect to time (for example, days).

The method can be performed in vitro as well as in vivo. In vivo applications include introducing a group of cells to an organism and then determining the effect of a compound administered to the organism on the state of the organism as well as the introduced cells. The art contains multiple methods of introducing a group of cells to an organism as well as methods of administering a compounds to an organism. The organism is preferably an animal such as a frog, mouse, rat, rabbit, monkey, or ape, and also a human.

Another aspect of the invention relates to a method of diagnosing an abnormal condition associated with cell proliferation or cell differentiation in an organism. The abnormal condition can be associated with an aberration in a signal transduction pathway characterized by an interaction between a PTP20 polypeptide and a natural binding partner. The method comprises the step of detecting an abnormal interaction.

The term "detecting an abnormal interaction" defines a method of identifying a PTP20 molecule with an aberration in its activity. Detection is accomplished by using an antibody or antibody fragment of the invention, a nucleic acid probe of the invention, or a compound of the invention.

Techniques used in the art that incorporate this method include in vitro, in vivo, and in situ hybridization techniques. These techniques utilize nucleic acid probes of the invention.

A preferred embodiment of the invention is the diagnosis method relating to an organism that is a mammal.

The summary of the invention described above is not limiting and other features and advantages of the invention will be apparent from the following detailed description of the invention, and from the claims.

DETAILED DESCRIPTION OF THE INVENTION

The present invention is based in part upon the isolation and characterization of nucleic acid molecules encoding a novel protein tyrosine phosphatase designated PTP20. The invention relates to nucleic acid molecules encoding portions of the PTP20 polypeptide, nucleic acid molecules encoding at least one PTP20 functional portion, nucleic acid vectors harboring such nucleic acid molecules, recombinant cells containing such nucleic acid vectors, purified polypeptides encoded by such nucleic acid molecules, antibodies to such polypeptides, and methods of identifying compounds that bind PTP20 or modulate its interactions with natural binding partners. Also disclosed are methods for diagnosing abnormal conditions in an organism with PTP20 related molecules or compounds.

The open reading frame of the full-length PTP20 nucleic acid molecule encodes a protein of 453 amino acids with a predicted molecular weight of approximately 50 kDa. Hydropathy analysis (see Kyte and Doolittle, 1982, J. Mol. Bio. 157:105-132) indicates that PTP20 contains no hydrophobic segments appropriate for signal peptide or transmembrane domains and therefore PTP20 is most likely an intracellular protein. The transcripts corresponding to nearly the same size of the full length cDNA are detected in several tissues including brain, liver, lung, spleen, skeletal muscle, kidney, and testis. The tissues may come from rats or other mammals.

The catalytic domain is located near the predicted amino terminus between amino acids 58 and 283. The catalytic domain of PTP20 may be homologous to the PTP-PEST-family phosphatases, such as human and rat PTP-PESTs and PEP-PTP. Takekawa et al., 1992, Biochem. Biophys. Res. Commun. 189:1223-1230; Yang et al., 1993, J. Biol. Chem. 268:6622-6628; Matthews et al., 1992, Mol. Cell. Biol. 12:2396-2405. Proline, glutamate, serine, and threonine residues (PEST) are enriched in the PEST-motif sequence, which is not arranged in any specified consensus sequence. Rechsteiner and Rogers, 1996, TIBS 21:267-271. PTP20 may have a PEST sequence between amino acids 285 and 453, suggesting that PTP20 may be a member of the PTP-PEST family.

Experimental results implicate PTP20 as an essential agent involved in a growth factor stimulated cellular differentiation signal transduction pathway. Although most cells have already differentiated in adults, activators of PTP20 might cause differentiation instead of proliferation of cellular tumors and therefore act as anti-cancer therapeutics. In addition, inhibitors of PTP20 might be useful for treating neural injuries by delaying the differentiation of transplanted neuronal stem cells until they are firmly grafted.

Various other features and aspects of the invention are: nucleic acid molecules encoding a PTP20 polypeptide; nucleic acid probes for the detection of PTP20; a probe-based method and kit for detecting PTP20 messages in other organisms; DNA constructs comprising a PTP20 nucleic acid molecule and cells containing these constructs; purified PTP20 polypeptides; PTP20 antibodies and hybridomas; antibody-based methods and kits for detecting PTP20; identification of agents; isolation of compounds which interact with a PTP20 polypeptide; compositions of compounds that interact with PTP20 and PTP20 molecules; pharmaceutical formulations and modes of administration; derivatives of complexes; antibodies to complexes; disruption of PTP20 protein complexes; purification and production of complexes; transgenic animals containing PTP20 nucleic acid constructs; antisense and ribozyme approaches, gene therapy; and evaluation of disorders. One skilled in the art appreciates that any modifications made to a complex can be manifested in a modification of any of the molecules in that complex. Thus, the invention includes any modifications to nucleic acid molecules, polypeptides, antibodies, or compounds in a complex. All of these aspects and features are explained in detail with respect to PYK-2 in PCT publication WO 96/18738, which is incorporated herein by reference in its entirety, including any drawings. Those skilled in the art will readily appreciate that such descriptions can be easily adapted to PTP20 as well, and are equally applicable to the present invention.

EXAMPLES

The examples below are non-limiting and are merely representative of various aspects and features of the present invention. The examples below demonstrate the isolation, and characterization of the novel protein PTP20.

Example 1 PCR Amplification and Cloning

In order to identify novel PTP genes involved in differentiation of PC12 cells, which are available from ATCC (CRL1721), reverse transcriptase-PCR amplification was performed using poly(A)⁺ RNA from both undifferentiated and differentiated PC12 cells as a template and degenerate primers that were deduced from highly conserved amino acid sequences within the PTP catalytic domain. This reaction gave specific products of 350-400 bp. These fragments were isolated, cloned and sequenced. PCR clone number 20 (PTP20) exhibited sequence similarities but was not identical to any previously known PTPs. Using the PTP20 PCR-generated cDNA fragment as a probe, a full length cDNA clone was isolated from a PC12 cDNA library and characterized by sequence analysis.

Degenerate oligonucleotide sense and antisense primers were based on consensus sequences for two highly conserved amino acid stretches within the catalytic domains of PTPs: FWXMXW (SEQ ID NO:2) and HCSAG(S/I/V)G (SEQ ID NO:3). Random-primed cDNA (up to 50 ng) from PC12 cell RNA was used as a template for PCR. Both sense and antisense primers were added to a 100 ml reaction mixture containing 20 mM Tris-HCl (pH8.4), 50 mM KCl, 2.5 mM MgCl₂, 0.01% BSA, ;ill four dNTPs (each at 200 mM), 1 unit of Taq polymerase (Boehringer Mannheim) and template cDNA. Thirty-five cycles were carried out on a thermal cycler; each cycle involved incubation at 94° C. for 1 min, at 42° C. for 1 min and 72° C. for 1 min. The PCR products were separated on a 1.5% agarose gel. Fragments of 350-400 bp were excised, subcloned into pBluescript KS(+) and sequenced.

The PTP20 PCR fragment was isolated, radioactively labeled by random priming, and used to screen 1×10⁶ plaques from a PC12 cDNA library which had been made using a pool of poly(A)⁺ RNA from both undifferentiated and differentiated PC12 cells, and a ZAPII synthesis kit (Stratagene). Hybridization was performed in a solution containing 50% (v/v) formamide, 5×SSC, 5×Denhardt solution, 0.05M sodium phosphate, 1 mM NaH2PO4, 1 mM Na4P2O7, 0.1 mM ATP, 5 mg salmon sperm DNA at 42° C. for 20 h. Washing was repeated three times with 2×SSC/0.1% SDS for 20 min at 42° C. Six positive clones were obtained and plaque-purified by secondary screening. Positive clones were rescued according to the manufacturer's instruction and sequenced in both directions.

The 2226 bp cDNA clone of PTP20 contained an open reading frame of 1359 bp preceded by 27 base pairs of 5'-non-coding region and 840 base pairs of 3'-non-coding region. The 3'-non-coding region contained the polyadenylation signal sequence AATAAA. The open reading frame encoded a protein of 453 amino acids with a predicted molecular weight of approximately 50 kDa.

Example 2 PCR Amplification and Cloning Using Human Cells

PTP genes involved in differentiation of human cells could be identified by reverse transcriptase-PCR amplification performed using poly(A)⁺ RNA from both undifferentiated and differentiated human cells as a template and degenerate primers that are deduced from highly conserved amino acid sequences within the PTP catalytic domain. The human cells could be isolated from various different tissues. This reaction gives specific products of different lengths. These fragments are isolated, cloned and sequenced. A PCR clone (PTP20) exhibits sequence similarities but is not identical to any previously known PTPs. Using the PTP20 PCR-generated cDNA fragment as a probe, a full length cDNA clone is isolated from a PC12 cDNA library and characterized by sequence analysis.

Degenerate oligonucleotide sense and antisense primers are based on consensus sequences for two highly conserved amino acid stretches within the catalytic domains of PTPs: FWXMXW and HCSAG(S/I/V)G. Random-primed cDNA (up to 50 ng) from PC12 cell RNA are used as a template for PCR. Both sense and antisense primers are added to a 100 ml reaction mixture containing 20 mM Tris-HCl (pH8.4), 50 mM KCl, 2.5 mM MgCl₂, 0.01% BSA, all four dNTPs (each at 200 mM), 1 unit of Taq polymerase (Boehringer Mannheim) and template cDNA. Thirty-five cycles are carried out on a thermal cycler; each cycle involves incubation at 94° C. for 1 min, at 42° C. for 1 min and 72° C. for 1 min. The PCR products are separated on a 1.5% agarose gel. Fragments of 350-400 bp are excised, subcloned into pBluescript KS(+) and sequenced.

The PTP20 PCR fragment is isolated, radioactively labeled by random priming, and used to screen 1×10⁶ plaques from a PC12 cDNA library which has been made using a pool of poly(A)⁺ RNA from both undifferentiated and differentiated PC12 cells, and a ZAPII synthesis kit (Stratagene). Hybridization is performed in a solution containing 50% (v/v) formamide, 5×SSC, 5×Denhardt solution, 0.05M sodium phosphate, 1 mM NaH2PO4, 1 mM Na4P2O7, 0.1 mM ATP, 5 mg salmon sperm DNA at 42° C. for 20 h. Washing is repeated three times with 2×SSC/0.1% SDS for 20 min at 42° C. Six positive clones are obtained and plaque-purified by secondary screening. Positive clones are rescued according to the manufacturer's instruction and sequenced in both directions.

Example 3 PTP20 Message in Response to NGF

To elucidate the role of PTP20 in the differentiation process of PC12 cells, Northern blot analysis was used to examine the expression pattern of PTP20 mRNA in PC12 cells treated with NGF for three or six days. Untreated PC12 cells exhibited a 2.3 kb PTP20 mRNA transcript. Following 3 days of NGF treatment, a 1.5-fold increase in the amount of transcript was observed. Another 3 days of NGF treatment caused a 2.4-fold increase as compared to untreated cells. In addition to the predominant 2.3 kb transcript, a faint band with 1.5 kb in size was also detected which also increased in abundance as NGF treatment continued. The expression pattern of PTP20 mRNA suggested that PTP20 might play a role during NGF-induced PC12 differentiation.

Northern blot analyses were carried out according to well established protocols. RNA was extracted from PC12 cells by acid guanidinium isothiocyanate-phenol-chloroform method. Chomczynski and Sacci, 1987, Anal. Biochem. 162:156-159. Poly(A)⁺ RNA was isolated with oligo(dT) Sepharose (Stratagene) column chromatography according to the manufacturer's instruction. Two micrograms of poly(A)⁺ RNA was electrophoresed in a formaldehyde/1.0% agarose gel, blotted to a nitrocellulose membrane filter, and hybridizes to ³² P-labeled full length PTP20 cDNA as a probe.

Example 4 PTP20 is Implicated in a Cellular Differentiation Signal Transduction Pathway

To further elucidate the function of PTP20 in cellular differentiation, PC12 cells were stably transfected with the PTP20 cDNA mammalian expression construct.

The following method was utilized to stably transfect cells with a gene encoding PTP20. Rat pheochromocytoma cells (PC12) were cultured in Dulbecco's modified Eagle's medium (DMEM) containing high glucose (4.5 g/liter) supplemented with 10% heat-inactivated horse serum (HS) and fetal calf serum (FCS). 5×10⁵ cells per 60 mm dish were incubated overnight in 4 ml of growth medium. The following day, the dish was washed once with serum-free medium and then incubated with a Lipofectin (5 ml)-DNA (2 my) mixture for 6 h. After 48 h, selection started in growth medium containing 500 mg/ml G418 (GIBCO BRL). Following 5 weeks of selection, discrete colonies were subcloned and expanded.

From seven stably transfected clonal cell lines, three independent clones showed high levels of PTP20 expression as assessed by Western blotting. In parental PC12 cells, endogenous PTP20 protein was beneath detection with the antibody. These three independent clones appeared morphologically similar to parental PC12 cells. However, following NGF treatment (50 ng/ml), all three clones showed accelerated neurite outgrowth, with 20 to 40% of the cells expressing neurites of more than two cell bodies in length at day 1 and more than 70% of the cells expressing such neurites at day 3. In contrast, the parental PC12 cells showed less than 5% of the cells with neurites of two cell bodies in length at day 1 and 47% at day 3. At day 4 following NGF treatment, more than 70% of both parental PC12 cells and PTP-PC12 cells expressed neurite outgrowth, however, the neurite length and the abundance of neurites in PTP-PC12 cells appeared longer and larger than those of parental PC12 cells. In addition, PTP-PC12 cells responded to lower concentrations of NGF than did parental PC12 cells. This suggests that NGF-induced differentiation was promoted by the expression of PTP20.

Example 5 Production of Functionally Active PTP20

To confirm that PTP20 encodes a functionally active PTPs, Rat-1 fibroblast cells were transiently transfected with mammalian expression constructs encoding either PTP20 or a Cys to Ser mutant of PTP20. Cell lysates were prepared and protein concentrations were determined. The expression level of both wild type and catalytically inactive mutant PTP20 was confirmed by Western blotting with anti-PTP-PEST antibodies. Cross-reactivity with on-specific proteins was not detected as evidenced by lack of a signal in control reactions. Nearly equivalent amounts of expressed protein were detected. The size of the detected protein was 50 kDa which is consistent with the predicted molecular weight of PTP20. For protein tyrosine phosphatase activity, equivalent amounts of protein from the transfected Rat-l cell lysates were tested using p-nitrophenylphosphate as a substrate. Lysates from transfected cells exhibited an approximately 2.5-fold higher PTP activity over those from control cells, whereas only basal levels of PTPase activity were detected in lysates from cells transfected with a construct encoding a catalytically inactive mutant of PTP20. These results indicate that full length PTP20 cDNA encodes a functionally active PTP.

The insert of PTP20 was excised with EcoRI digestion and integrated into an expression vector, pcDNA3 (Invitrogen) which had been digested with the same restriction enzyme. The direction of the insert in the plasmid was confirmed by restriction mapping. Rat-i cells were transfected with the plasmid (2 mg/1×10⁶ cells) by using Lipofectin (GIBCO BRL). After 48 h of culturing, the cells were washed with PBS and then lysed with lysis buffer [50 mM HEPES, pH 7.5, containing 150 mM NaCl, 1 mM EDTA, 10% (v/v) glycerol, 1% (v/v) Triton X-100, 1 mM phenylmethylsulfonyl fluoride, 1 mM sodium orthovanadate, mg/ml aprotinin]. Protein concentrations of cell lysates were measured with a protein assay kit (Bio-Rad) using bovine serum albumin as a standard. Equivalent amounts of protein were used for Western blot analyses and phosphatase activity assay.

The PTP20 mutant containing a cysteine to serine alteration at position 229 was generated using an oligonucleotide primer, CTCTGTGTCCACAGCAGTGCTGGCTGT (SEQ ID NO:4). Kunkel, 1985, Proc. Natl. Acad. Sci. USA 82:488-492. The mutation was confirmed by DNA sequencing.

For Western blot analysis, cells were first lysed in lysis buffer. To assess PTP20 expression, equivalent amounts of protein in the cell lysates were separated by 10% SDS-PAGE and electrophoretically transferred to nitrocellulose membranes. The membranes were first incubated with rabbit anti-PTP-PEST antibodies, and then a peroxidase-coupled goat anti-rabbit secondary antibody (BioRad) was added, followed by an enhanced chemiluminescence (ECL) substrate (Amersham) reaction. The substrate reaction was detected on a X-ray film (Amersham). The anti-PTP-PEST antibody was raised against the C-terminal 56 amino acids of human PTP-PEST (Takekawa et al., 1992, Biochem. Biophys. Res. Commun. 189:1223-1230) which was expressed as a GST fusion protein.

Phosphatase activity of PTP20 was assayed in a 200 ml solution containing 25 mM MES

(2-[N-morpholino]ethanesulfonic acid), pH 5.5, 1.6 mM DTT, mM p-nitrophenylphosphate as a substrate and 50 mg protein of cell lysate at 37° C. for 30 min. The reaction was stopped by the addition of 100 ml of 1N NaOH, and the absorbance was measured at 405 nm.

Example 6 PTP20 Specifically Dephosphorylates EGF Receptor, Thereby Triggering NGF-induced Differentiation of PC12 Cells

Materials and Methods

Construction of plasmids

pUHD15-1neo contains the tetracycline-controlled transactivator (tTA) coding sequence driven by the human cytomega virus (CMV) promoter/enhancer (Gossen, M. et al., (1992) Proc. Natl. Acad. Sci. USA, 89, 5547-5551). pUHD10-3 contains the PhCMV*-1 followed by a multiple cloning sequence as described in the art (Gossen et al. 1992). In this study, 6 histidine residues were tagged at the N-terminus of PTP20 wild-type (wt) or catalytically inactive mutant (C/S mutant) by PCR amplification with primers as follows; sense primer: 5'-ATG AAT TCG CCA CCA TGC ATC ACC ATC ACC ATC ACA GTC GCC AAT CGG ACC TAG TGA GG-3' (SEQ ID NO:5) and antisense primer: 5'-ATC TCG AGT TAC ACC CGT GTC CAC TCC GCT GGA GGA TC-3' (SEQ ID NO:6). After PCR amplification, the products were subcloned EcoRl and XbaI sites into pBluescript KS(+) (Stratagene) and sequenced. Then, the products were ligated into pUHD10-3 via the same sites. By transient expression in 293 cells, histidine-tagged PTP20 exhibited indistinguishable PTPase activity toward p-nitrophenyl phosphate from that of native PTP20.

Establishment of cell lines

PC12 cells which stably express tTA were produced by transfecting 2 μg of pUHD15-1neo with 5 μl of Lipofectin (Life Technologies, Inc.) according to the manufacturer's instruction. Following 4 weeks of selection, cells resistant to 500 μg/ml G418 were cloned and tested for tTA expression by an assay for the transient expression of the galactosidase gene in pUHD10-3 (pUHD-gal). In this assay, 2 μg of pUHD-gal was transfected into 5×10⁵ cells, which had been cultured overnight in the presence or absence of tetracycline (2 μg/ml), using 5 μl of Lipofectin for 6 h. Then the cells were split into two 6-cm dishes in the presence or absence of tetracycline (2 μg/ml) and then cultured for another 48 h. P-galactosidase assays were done using o-nitrophenyl-β-D-galactopyranoside as a substrate. Clone 15-1#5 which showed the highest induction of β-galactosidase activity was selected and used for further transfection of PTP20. The stable transfection of pUHD-PTP20 wt or pUHD-PTP20 C/S into tTA-expressing clone 15-1#5 was performed with Lipofectin as described above. Two micrograms of pUHD-PTP20 wt or pUHD-PTP20 C/S was transfected into host cells with 0.2 μg of pSV2-hph which contains the hygromycin-resistant gene driven by rous sarcoma virus promoter. Cells were selected with 200 μg/ml hygromycin B (Boehringer Mannheim). Colonies were cloned, expanded and further analysed.

Immunoblotting

Expression of PTP20 was assessed as described previously using anti-PTP-PEST antibody (Aoki et al., (1996) J. Biol. Chem., 271, 294422-29426). To perform co-precipitation experiments for EGF receptor or histidine-tagged PTP20, precleared lysate was incubated with Lentil Sepharose or Ni-NTA agarose (QIAGEN), respectively. Immunoprecipitation of NGF receptor and SHC was carried out essentially as described in the art (Obermeier et al., (1993) J. Biol. Chem., 268, 22963-22966). After extensive washing, the precipitates were separated by SDS-PAGE followed by blotting onto nitrocellulose membrane (Amersham). The blots were incubated with 4G10 (UBI) followed by incubation with peroxidase-conjugated goat anti-mouse immunoglobulin. The ECL system (Amersham) was used to visualize proteins recognized by the antibodies. After stripping antibodies off the blots, the procedures were repeated with other antibodies.

Neurite Outgrowth

Prior to experiments, cells were cultured in DMEM containing 10% horse serum (HS) and 5% fetal calf serum (FCS) for 48 h in the presence or absence of tetracycline for the expression of PTP20. Then, cells were inoculated into 6-well plates at a density of 0.5-1.0×10⁴ cells per well, cultured in the same medium for 1 day, and the cell culture medium was changed to DMEM containing 1% HS, 0.5% FCS and human ONGF (Boehringer Mannheim) in the presence or absence of tetracycline as indicated (Day 0). At indicated days, 100 to 200 cells were counted under microscopy and scored as the expression of neurite longer than one cell body.

[³ H]thymidine incorporation assay

Prior to experiments, cells were cultured in DMEM containing 10% HS and 5% FCS for 48 h in the presence or absence of tetracycline. Then, cells were plated at 1×10⁵ cells per ml in 0.5 ml of DMEM containing 0.5% HS and 0.25% FCS in 24-well plates. After 2 days, the medium was replaced with 0.5 ml of DMEM containing 10% HS and 5% FCS, EGF (50 ng/ml), or NGF (50 ng/ml) and incubated for 18 h. Then, 0.8 μCi of [³ H]thymidine (Amersham) per ml was added, and the incubation was prolonged for 4 h. Cells were then washed with ice-cold 5% trichloroacetic acid. Insoluble material was harvested and quantified in the presence of liquid scintillator.

Proliferation Assay

Prior to experiments, cells were cultured in DMEM containing 10% HS and 5% FCS for 48 h in the presence or absence of tetracycline. Then, 5×10³ cells per well of 96 well plate were inoculated and cultured in the same medium in the presence or absence of tetracycline. At indicated days, 10 μl of cell proliferation reagent WST-1 solution (Boehringer Mannheim) was directly added to the culture medium and further incubated for 4 h at 37° C. The absorbance at 420 nm was determined with ELISA reader.

MAP kinase assay

Soluble cellular lysates prepared in lysis buffer plus phosphatase and protease inhibitors were adjusted to equal protein concentrations and incubated with anti-ERK2 antiserum (Santa Cruz) and Protein A-Sepharose (Pharmacia) for 2 h at 4° C. MAP kinase reactions were performed as described in the art (Traverse et al., (1992) Biochem. J., 288, 351-355) with slight modifications using myelin basic protein as a substrate, after which the samples were separated by SDS-polyacrylamide gel electrophoresis on a 15% polyacrylamide gel. Kinase activity was determined by analysing the dried gel with a PhosphoImager (Fuji Film).

Results

Selection of PTP20-Expressing Clones Previously it had been observed that PC12 cell clones over-expressing PTP20 grew more slowly than parental PC12 cells, suggesting that the PTPase interfered with cell proliferation signals. To clarify this, tetracycline-controlled system (Gossen et al., 1992) was employed under the conditions that would express the PTP20 gene in PC12 cells. After establishment of a tetracycline-controlled transactivator (tTA) cell line, which exhibited low background expression of a β-galactosidase reporter gene and approximately 30 fold induction after tetracycline (Tet) removal, lines that overexpressed PTP20 upon release of Tet expression were generated.

Following transfection with either pUHD-PTP20 wt or pUHD-PTP20 C/S plus hygromycin resistant plasmid into the 15-1#5 clone, 18 clones each resistant to hygromycin B were isolated and analysed by Western blotting using anti-PTP-PEST antibodies. Three clones out of 18 displayed high expression of PTP20 wt and another 3 displayed high expression of C/S mutant when the clones were cultured in the absence of tetracycline, whereas nearly no PTP20 was detected in the presence of tetracycline. On the same blots, nearly the same amount of PTP1B was detected, indicating that the difference in the amount of PTP20 was not due to misleading the samples or other experimental errors. In control cells which had been transfected with empty pUHD10-3 and pSV-hph, the expression of PTP20 wt and C/S mutant was undetectable with antibody used. Examination of various induction times in Tet-free cell culture medium indicated that for maximal expression at least 48 h incubation was needed. Wild-type and C/S mutant clones showed normal morphology indistinguishable from that of native PC12 cells.

The tetracycline-controlled gene expression system allows quantitatively controlled expression of a target gene by the addition or removal of tetracycline (Yin et al., (1995) Cancer Res., 55, 4922-4928). In our PC12 system, the expression of PTP20 wt and C/S mutant was well controlled in a tetracycline concentration-dependent manner whereas the amount of endogenous PTPlB on the same blot was unaffected. Densitometeric analysis showed that expression of PTP20 wt and C/S mutant was reduced by 72% and 82%, respectively, by increasing the amount of tetracycline from 0 to 0.1 μg/ml.

Effect of PTP20 Wild-Type and C/S Mutant Overexpression on Differentiation and Proliferation of PC12 Cells

It has been shown that overexpression of PTP20 resulted in accelerated NGF-induced differentiation of PC12 cells (Aoki et al., 1996). This was confirmed with the clone 4 of PTP20 wt. Overexpression of PTP20 wt by removal of tetracycline from the cell culture medium caused rapid differentiation, in which more than 80% of cells expressed neurites 2 days after NGF treatment. Furthermore, the addition of tetracycline (0.1 μg/ml) reduced the efect to about 50%. This level was still much higher than that in the cells exposed to tetracycline with expression levels that were not detectable with the Ab used.

In contrast, induction of PTP20 C/S overexpression by removal of tetracycline suppressed the NGF-induced differentiation of PC12 cells. Only 30% of cells expressed neurites even after 4 days of NGF treatment. This effect was decreased by the addition of tetracycline (0.1 μg/ml). Control cells exhibited nearly the same extent of neurite outgrowth, independent of the amount of tetracycline added to the cell culture medium while EGF had no effect on neurite outgrowth of cells expressing either PTP20 wt or C/S mutant.

EGF and NGF have opposing effects on PC12 cells. EGF as a mitogen stimulates the proliferation of PC12 cells, whereas NGF induces differentiation. As mentioned above, PTP20 wt stimulated the NGF-induced differentiation and preliminary data showed that overexpression of the PTP20 wt resulted in the inhibition of proliferation in serum-supplemented medium. This was clearly demonstrated using the tetracycline-controlled gene expression system. Overexpression of PTP20 wt resulted in approximately 50% reduction in [³ H]-thymidine incorporation of the PC12 cells upon serum or EGF stimulation, which was recovered by the addition of tetracycline (0.1 μg/ml). On the other hand, overexpression of PTP20 C/S mutant significantly accelerated the [³ H]-thymidine incorporation. These were well consistent with proliferation assay of PC12 cells grown in the serum-containing medium. Overexpression of PTP20 wt caused dramatic reduction in proliferation of the cells, whereas PTP20 C/S mutant slightly but significantly accelerated the growth of the cells.

Effect of Overexpression of PTP20 Wild-Type and C/S Mutant on MAP Kinase Activity upon EGF or NGF Stimulation

In PC12 cells, both EGF and NGF activate the p42 and p44 isoforms of MAP kinase. To relate the effects of overexpression of PTP20 wt and C/S mutant on NGF-induced differentiation and mitogen-promoted proliferation, the time course of MAP kinase activity was determined. Consistent with the above data, overexpression of PTP20 wt by removal of tetracycline resulted in higher and sustained MAP kinase activation upon NGF stimulation leading to the accelerated differentiation. This effect was diminished by the addition of tetracycline. In contrast, EGF-treated MAP kinase activation was inhibited by overexpressing PTP20 wt. This was comparable to the data that overexpression of v-Crk caused sustained MAP kinase activation leading to neurite outgrowth upon EGF stimulation (Hempstead et al., (1994) Mol. Cell. Biol., 14, 1964-1971). On the other hand, overexpression of PTP20 C/S mutant caused inhibition or promotion of MAP kinase activation upon NGF or EGF stimulation, respectively. These data suggest that PTP20 inhibit the proliferation of PC12 cells, allowing the cells to lead to accelerated NGF-induced differentiation.

Effect of Overexpression of PTP20 Wild-Type or C/S Mutant on Cellular Tyrosine Phosphorylation

To investigate the effect of overexpression of PTP20 wt or C/S mutant on cellular tyrosine phosphorylation, PC12 cells overexpressing either PTP20 wt or C/S mutant upon removal of tetracycline were stimulated with either EGF or NGF. When PTP20 wt was overexpressed, dramatic reduction in tyrosine phosphorylation was observed upon EGF stimulation. This was further investigated by precipitating specific molecules upon ligand stimulation. Overexpression of PTP20 wt resulted in dramatic dephosphorylation of EGF receptor and that tyrosine phosphorylated proteins with molecular masses of 130 kDa and 50 kDa were brought down with EGF receptor upon EGF stimulation only when PTP20 C/S mutant was overexpressed. 50 kDa protein was shown to be PTP20 C/S mutant by Western blotting using anti-PTP-PEST antibody. To confirm this, lysate was precipitated with Ni-TNA-agarose, which is known to associate specifically with histidine residues. Only PTP20 C/S mutant became tyrosine phosphorylated and EGF receptor and 130 kDa protein were co-precipitated. So far the 130 kDa protein has not been identified. Tyrosine phosphorylation level of SHC was also diminished by overexpression of PTP20 wt upon removal of tetracycline from the cell culture medium. In contrast, overexpression of PTP20 wt or C/S mutant did not seem to influence the tyrosine phosphorylation level of NGF receptor and SHC upon NGF stimulation and no physical association between NGF receptor and PTP20 wt or C/S mutant was observed.

One skilled in the art would readily appreciate that the present invention is well adapted to carry out the objects and obtain the ends and advantages mentioned, as well as those inherent therein. The molecular complexes and the methods, procedures, treatments, molecules, specific compounds described herein are presently representative of preferred embodiments are exemplary and are not intended as limitations on the scope of the invention. Changes therein and other uses will occur to those skilled in the art which are encompassed within the spirit of the invention are defined by the scope of the claims.

It will be readily apparent to one skilled in the art that varying substitutions and modifications may be made to the invention disclosed herein without departing from the scope and spirit of the invention.

All patents and publications mentioned in the specification are indicative of the levels of those skilled in the art to which the invention pertains. All patents and publications are herein incorporated by reference to the same extent as if each individual publication was specifically and individually indicated to be incorporated by reference.

The invention illustratively described herein suitably may be practiced in the absence of any element or elements, limitation or limitations which is not specifically disclosed herein. Thus, for example, in each instance herein any of the terms "comprising", "consisting essentially of" and "consisting of" may be replaced with either of the other two terms. The terms and expressions which have been employed are used as terms of description and not of limitation, and there is no intention that in the use of such terms and expressions of excluding any equivalents of the features shown and described or portions thereof, but it is recognized that various modifications are possible within the scope of the invention claimed. Thus, it should be understood that although the present invention has been specifically disclosed by preferred embodiments and optional features, modification and variation of the concepts herein disclosed may be resorted to by those skilled in the art, and that such modifications and variations are considered to be within the scope of this invention as defined by the appended claims.

In addition, where features or aspects of the invention are described in terms of Markush groups, those skilled in the art will recognize that the invention is also thereby described in terms of any individual member or subgroup of members of the Markush group. For example, if X is described as selected from the group consisting of bromine, chlorine, and iodine, claims for X being bromine and claims for X being bromine and chlorine are fully described.

Other embodiments are within the following claims.

    __________________________________________________________________________     #             SEQUENCE LISTING                                                   - -  - - (1) GENERAL INFORMATION:                                              - -     (iii)  NUMBER OF SEQUENCES: 7                                          - - (2) INFORMATION FOR SEQ ID NO:  1:                                         - -     (i)  SEQUENCE CHARACTERISTICS:                                         - -          (A)  LENGTH:     - #      2226 base pairs                                 (B)  TYPE:     - #        nucleic acid                                         (C)  STRANDEDNESS:   - #  single                                               (D)  TOPOLOGY:    - #     linear                                       - -     (ii)  MOLECULE TYPE: cDNA to tRNA                                      - -     (xi)  SEQUENCE DESCRIPTION: SEQ ID NO: - #  1:                         - -  GAATTCCGGC ACGAGGCGGG TTGCAGTATG AGTCGCCAAT CGGACCTAGT - #GAGGAGCTT     C    60                                                                          - -  TTGGAGCAGC AGGAGGCCCG GGACCACCGG AAGGGGGCAA TCCTCGCCCG - #TGAGTTCAG     C   120                                                                          - -  GACATTAAGG CCCGCTCAGT GGCTTGGAAG ACTGAAGGTG TGTGCTCCAC - #TAAAGCCGG     C   180                                                                          - -  AGTCAGCAGG GAAACTCAAA GAAGAACCGC TACAAAGACG TGGTACCGTA - #TGATGAGAC     G   240                                                                          - -  AGAGTCATCC TTTCCCTGCT CCAGGAGGAA GGACACGGAG ATTACATTAA - #TGCCAACTT     C   300                                                                          - -  ATCCGGGGCA CAGATGGAAG CCAGGCCTAC ATTGCGACGC AAGGACCCCT - #GCCTCACAC     T   360                                                                          - -  CTGTTGGACT TCTGGCGCCT GGTTTGGGAG TTTGGAATCA AGGTGATCTT - #GATGGCCTG     T   420                                                                          - -  CAGGAGACAG AAAATGGACG GAGGAAGTGT GAACGCTACT GGGCCCAGGA - #GCGGGAGCC     T   480                                                                          - -  CTACAGGCCG GGCCTTTCTG CATCACCCTG ACAAAGGAGA CAGCACTGAC - #TTCGGACAT     C   540                                                                          - -  ACTCTCAGGA CCCTCCAGGT TACATTCCAG AAGGAATCCC GTCCTGTGCA - #CCAGCTACA     G   600                                                                          - -  TACATGTCTT GGCCGGACCA CGGGGTTCCC AGCAGTTCCG ATCACATTCT - #CACCATGGT     G   660                                                                          - -  GAGGAGGCCC GTTGCCTCCA AGGACTTGGA CCTGGACCCC TCTGTGTCCA - #CTGCAGTGC     T   720                                                                          - -  GGCTGTGGAC GAACAGGTGT CTTGTGTGCT GTTGATTACG TGAGGCAGTT - #GCTTCTGAC     T   780                                                                          - -  CAGACAATCC CACCCAATTT CAGCCTCTTT GAAGTGGTCC TGGAGATGCG - #GAAACAGCG     A   840                                                                          - -  CCTGCAGCGG TGCAGACAGA GGAGCAGTAC AGGTTCCTGT ACCACACAGT - #GGCTCAGCT     A   900                                                                          - -  TTCTCCCGCA CTCTCCAGAA CAACAGTCCC CTCTACCAGA ACCTCAAGGA - #GAACCGCGC     T   960                                                                          - -  CCAATCTGCA AGGACTCCTC GTCCCTCAGG ACCTCCTCAG CCCTGCCTGC - #CACATCCCG     C  1020                                                                          - -  CCACTGGGTG GCGTTCTCAG GAGCATCTCG GTGCCTGGGC CACCGACCCT - #TCCCATGGC     T  1080                                                                          - -  GACACTTACG CTGTGGTGCA GAAGCGTGGC GCTTCCGGCA GCACAGGGCC - #GGGCACGCG     G  1140                                                                          - -  GCGCCCAACA GCACGGACAC CCCGATCTAC AGCCAGGTGG CTCCACGTAT - #CCAGCGGCC     C  1200                                                                          - -  GTGTCACACA CCGAAAACGC GCAGGGGACA ACGGCACTGG GCCGAGTTCC - #TGCGGATGA     A  1260                                                                          - -  AACCCTTCCG GGCCTGATGC CTATGAGGAA GTAACAGATG GAGCGCAGAC - #TGGTGGGCT     A  1320                                                                          - -  GGCTTCAACT TGCGCATTGG AAGACCTAAA GGGCCACGGG ATCCTCCAGC - #GGAGTGGAC     A  1380                                                                          - -  CGGGTGTAAT GAGTGCTGTA CCAGTTCCAG CCTGTCACTC AGTGGTGGCT - #GGGCGACTG     C  1440                                                                          - -  AACCCCCATG CTGCTGTGTG CTGTCTTATG TATGAGTGGG ACTCATGGGC - #CTGAATCAA     A  1500                                                                          - -  ATAAAAGTTT CTCAGGGTAG AAAAAAACAA ATAGGGACTT TGGCCAGTGG - #TTATAGCAG     T  1560                                                                          - -  CAAAGCCAGG GGCTAGGAGG GGTAAGTGGG GGAGGTGGTG GATCTACTCT - #GAGAAAGTT     T  1620                                                                          - -  AGGAAAGCAC ATCAAGAGTG AGCATCGCCA CTCTTCTCCC CATACACCTA - #CTGGAAAGT     G  1680                                                                          - -  CACCCCAGAC AGAGTCCTAA CTTGACAGTG CACCTCAGAC AGGTCGCTAC - #CTGGATGGA     C  1740                                                                          - -  ATGCTGGCCC TACAGCTAGA GACATGTCTA ATTAGATCCT CATGTAAACT - #TGCAATGAG     C  1800                                                                          - -  TAGAAAGATC TCCGTCTGGT CAGGGAAATG GATCACCTAG TCAGGTAAAT - #AGTGTGCCA     T  1860                                                                          - -  CCAGAAGACA GAACTGCAAG ATACCGTCTT TCTCAAAATG GAAGAAAATA - #GATCCTCAA     G  1920                                                                          - -  AATAAATGTA TGTACAATGC TCTACGCCCT GATCCTGCCC TGCCTCACTG - #CCATAATGT     C  1980                                                                          - -  ACAAACAAGT CAGGGTCTAT ATGACAGTTG TTCATCTAGT CAGTCCTGAC - #TGTGGCCTC     T  2040                                                                          - -  GCAGGCTCAG ATAGTGCCTT CTGCAGACTC TTGGAATGCC CGTCTTGAAC - #TTGATGAAA     G  2100                                                                          - -  CTTCTACCGG GAACTTGTAA ACATCATTAA AATTATTAAT GTAGAATTCA - #ATAAAGAGT     G  2160                                                                          - -  GGTCAAAAAC TCAAAAAAAA AAAAAAAAAA AAAAAAAAAC TCGAGAGTAC - #TTCTAGAGC     G  2220                                                                          - -  GGCGGG                - #                  - #                  -       #         2226                                                                    - -  - - (2)  INFORMATION FOR SEQ ID NO:  2:                                   - -     (i)  SEQUENCE CHARACTERISTICS:                                         - -           (A)  LENGTH:    - #       6 amino acids                                   (B)  TYPE:    - #         amino acid                                           (C)  STRANDEDNESS:  - #   single                                               (D)  TOPOLOGY:   - #      linear                                      - -     (ii)  MOLECULE TYPE: peptide                                              (ix)  FEATURE:                                                              - -            (D)  OTHER INFORMAT - #ION:  "Xaa" in positions 3 and 5      stand                                                                                             - #                  - # for an unspecified amino           acid.                                                                             - -     (xi)  SEQUENCE DESCRIPTION: SEQ ID NO: - #  2:                         - -  Phe Trp Xaa Met Xaa Trp                                                    1               5                                                             - - (2)  INFORMATION FOR SEQ ID NO:  3:                                        - -           (i)  SEQUENCE CHARACTER - #ISTICS:                               - -           (A)  LENGTH:    - #       7 amino acids                                   (B)  TYPE:    - #         amino acid                                           (C)  STRANDEDNESS:  - #   single                                               (D)  TOPOLOGY:   - #      linear                                      - -     (ii)  MOLECULE TYPE:   peptide                                         - -     (ix)  FEATURE:                                                         - -           (D)  OTHER INFORMATION: - #   "Xaa" in position 6 stands      for                                                                                            either Se - #r, Ile or Val.                                      - -     (xi)  SEQUENCE DESCRIPTION: SEQ ID NO: - #  3:                         - -  His Cys Ser Ala Gly Xaa Gly                                                1               5                                                             - -  - - (2)  INFORMATION FOR SEQ ID NO:  4:                                   - -      (i)  SEQUENCE CHARACTERISTICS:                                        - -           (A)  LENGTH:    - #       27 base pairs                                   (B)  TYPE:    - #         nucleic acid                                         (C)  STRANDEDNESS:  - #   single                                               (D)  TOPOLOGY:   - #      linear                                      - -     (xi)  SEQUENCE DESCRIPTION: SEQ ID NO: - #  4:                         - -  CTCTGTGTCC ACAGCAGTGC TGGCTGT         - #                  - #                  27                                                                       - - (2)  INFORMATION FOR SEQ ID NO:  5:                                        - -      (i)  SEQUENCE CHARACTERISTICS:                                        - -           (A)  LENGTH:    - #       59 base pairs                                   (B)  TYPE:    - #         nucleic acid                                         (C)  STRANDEDNESS:  - #   single                                               (D)  TOPOLOGY:   - #      linear                                      - -     (xi)  SEQUENCE DESCRIPTION: SEQ ID NO: - #  5:                         - -  ATGAATTCGC CACCATGCAT CACCATCACC ATCACAGTCG CCAATCGGAC - #CTAGTGAGG          59                                                                          - -  - - (2)  INFORMATION FOR SEQ ID NO:  6:                                   - -      (i)  SEQUENCE CHARACTERISTICS:                                        - -           (A)  LENGTH:    - #       38 base pairs                                   (B)  TYPE:    - #         nucleic acid                                         (C)  STRANDEDNESS:  - #   single                                               (D)  TOPOLOGY:   - #      linear                                      - -     (xi)  SEQUENCE DESCRIPTION: SEQ ID NO: - #  6:                         - -  ATCTCGAGTT ACACCCGTGT CCACTCCGCT GGAGGATC     - #                       - #     38                                                                       - -  - - (2)  INFORMATION FOR SEQ ID NO:  7:                                   - -      (i)  SEQUENCE CHARACTERISTICS:                                        - -           (A)  LENGTH:    - #       453 amino acids                                 (B)  TYPE:    - #         amino acid                                           (C)  STRANDEDNESS:  - #   single                                               (D)  TOPOLOGY:   - #      linear                                      - -     (ii)  MOLECULE TYPE:       - # peptide                                 - -     (xi)  SEQUENCE DESCRIPTION: SEQ ID NO: - #  7:                         - -  Met Ser Arg Gln Ser Asp Leu Val Arg Ser - #Phe Leu Glu Gln Gln Glu         1               5 - #                 10 - #                 15               - -  Ala Arg Asp His Arg Lys Gly Ala Ile Leu - #Ala Arg Glu Phe Ser Asp                    20     - #             25     - #             30                   - -  Ile Lys Ala Arg Ser Val Ala Trp Lys Thr - #Glu Gly Val Cys Ser Thr                35         - #         40         - #         45                       - -  Lys Ala Gly Ser Gln Gln Gly Asn Ser Lys - #Lys Asn Arg Tyr Lys Asp            50             - #     55             - #     60                           - -  Val Val Pro Tyr Asp Glu Thr Arg Val Ile - #Leu Ser Leu Leu Gln Glu        65                 - # 70                 - # 75                 - # 80        - -  Glu Gly His Gly Asp Tyr Ile Asn Ala Asn - #Phe Ile Arg Gly Thr Asp                        85 - #                 90 - #                 95               - -  Gly Ser Gln Ala Tyr Ile Ala Thr Gln Gly - #Pro Leu Pro His Thr Leu                    100     - #            105     - #            110                  - -  Leu Asp Phe Trp Arg Leu Val Trp Glu Phe - #Gly Ile Lys Val Ile Leu                115         - #        120         - #        125                      - -  Met Ala Cys Gln Glu Thr Glu Asn Gly Arg - #Arg Lys Cys Glu Arg Tyr            130             - #    135             - #    140                          - -  Trp Ala Gln Glu Arg Glu Pro Leu Gln Ala - #Gly Pro Phe Cys Ile Thr        145                 - #150                 - #155                 -          #160                                                                              - -  Leu Thr Lys Glu Thr Ala Leu Thr Ser Asp - #Ile Thr Leu Arg Thr         Leu                                                                                               165 - #                170 - #                175             - -  Gln Val Thr Phe Gln Lys Glu Ser Arg Pro - #Val His Gln Leu Gln Tyr                    180     - #            185     - #            190                  - -  Met Ser Trp Pro Asp His Gly Val Pro Ser - #Ser Ser Asp His Ile Leu                195         - #        200         - #        205                      - -  Thr Met Val Glu Glu Ala Arg Cys Leu Gln - #Gly Leu Gly Pro Gly Pro            210             - #    215             - #    220                          - -  Leu Cys Val His Cys Ser Ala Gly Cys Gly - #Arg Thr Gly Val Leu Cys        225                 - #230                 - #235                 -          #240                                                                              - -  Ala Val Asp Tyr Val Arg Gln Leu Leu Leu - #Thr Gln Thr Ile Pro         Pro                                                                                               245 - #                250 - #                255             - -  Asn Phe Ser Leu Phe Glu Val Val Leu Glu - #Met Arg Lys Gln Arg Pro                    260     - #            265     - #            270                  - -  Ala Ala Val Gln Thr Glu Glu Gln Tyr Arg - #Phe Leu Tyr His Thr Val                275         - #        280         - #        285                      - -  Ala Gln Leu Phe Ser Arg Thr Leu Gln Asn - #Asn Ser Pro Leu Tyr Gln            290             - #    295             - #    300                          - -  Asn Leu Lys Glu Asn Arg Ala Pro Ile Cys - #Lys Asp Ser Ser Ser Leu        305                 - #310                 - #315                 -          #320                                                                              - -  Arg Thr Ser Ser Ala Leu Pro Ala Thr Ser - #Arg Pro Leu Gly Gly         Val                                                                                               325 - #                330 - #                335             - -  Leu Arg Ser Ile Ser Val Pro Gly Pro Pro - #Thr Leu Pro Met Ala Asp                    340     - #            345     - #            350                  - -  Thr Tyr Ala Val Val Gln Lys Arg Gly Ala - #Ser Gly Ser Thr Gly Pro                355         - #        360         - #        365                      - -  Gly Thr Arg Ala Pro Asn Ser Thr Asp Thr - #Pro Ile Tyr Ser Gln Val            370             - #    375             - #    380                          - -  Ala Pro Arg Ile Gln Arg Pro Val Ser His - #Thr Glu Asn Ala Gln Gly        385                 - #390                 - #395                 -          #400                                                                              - -  Thr Thr Ala Leu Gly Arg Val Pro Ala Asp - #Glu Asn Pro Ser Gly         Pro                                                                                               405 - #                410 - #                415             - -  Asp Ala Tyr Glu Glu Val Thr Asp Gly Ala - #Gln Thr Gly Gly Leu Gly                    420     - #            425     - #            430                  - -  Phe Asn Leu Arg Ile Gly Arg Pro Lys Gly - #Pro Arg Asp Pro Pro Ala                435         - #        440         - #        445                      - -  Glu Trp Thr Arg Val                                                           450                                                                      __________________________________________________________________________ 

What is claimed is:
 1. An isolated, or purified polypeptide, wherein said polypeptide comprises at least 100 contiguous amino acids of the amino acid sequence set forth in SEQ ID NO:7.
 2. The polypeptide of claim 1, wherein said polypeptide comprises at least 200 contiguous amino acids of the amino acid sequence set forth in SEQ ID NO:7.
 3. The polypeptide of claim 1, wherein said polypeptide comprises at least 300 contiguous amino acids of the amino acid sequence set forth in SEQ ID NO:7.
 4. The polypeptide of claim 1, wherein said polypeptide comprises at least 400 contiguous amino acids of the amino acid sequence set forth in SEQ ID NO:7.
 5. An isolated, or purified polypeptide comprising the amino acid sequence set forth in SEQ ID NO:7.
 6. An isolated, or purified polypeptide that differs from the polypeptide having the amino acid sequence of SEQ ID NO:7 in that it lacks one or two of the following segments of amino acid residues: 1-58, 59-294, or 295-453.
 7. An isolated, or purified polypeptide comprising one or more of the following segments of amino acid residues of SEQ ID NO:7: 1-58, 59-294, or 295-453.
 8. A preparation containing a polypeptide consisting of the amino acid sequence set forth in SEQ ID No:6, where in said preparation is enriched in said polypeptide at least two fold relative to its content in either normal or diseased cells.
 9. A preparation containing a polypeptide comprising at least 100 contiguous amino acids of the amino acid sequence set forth in SEQ ID No:6, where in said preparation is enriched in said polypeptide least two fold relative to its content in either normal or diseased cells.
 10. A preparation containing a polypeptide comprising at least 100 contiguous amino acids of the amino acid sequence set forth in SEQ ID No:6, where in said preparation is enriched in said polypeptide at least two fold relative to its content in either normal or diseased cells, said polypeptide lacking at least one of the following segments of amino acids from SEQ ID No:6: 1-58, 59-294 and 295-453.
 11. A preparation containing a polypeptide consisting at least one of the following segments of amino acid residues of SEQ ID No:6: 1-58, 59-294 or 295-453, where in said preparation is enriched in said polypeptide at least two fold relative to its content in either normal or diseased cells. 